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24 Abstract 

25 Coronaviruses pose serious health threats to humans and other animals. 

26 Understanding the mechanisms of their replication has important implications for 

27 global health and economic stability. Nonstructural protein 9 (nsp9) is an essential 

28 RNA binding protein for coronavirus replication. However, the mechanisms of the 

29 dimerization and nucleic acid binding of nsp9 remain elusive. Here, we report four 

30 crystal structures, including wild-type porcine delta coronavirus (PDCoV) nsp9, 

31 PDCoV nsp9-AN7 (N-terminal 7 amino acids deleted), wild-type porcine epidemic 

32 diarrhea virus (PEDV) nsp9, and PEDV nsp9-C59A mutant. These structures reveal 

33 the diverse dimerization forms of coronavirus nsp9. We first find that the N-finger of 

34 nsp9 from PDCoV plays a critical role in dimerization. Meanwhile, PEDV nsp9 is 

35 distinguished by the presence of a disulfide bond in the dimer interface. Interestingly, 

36 size-exclusion chromatography and analytical ultracentrifugation analyses indicate 

37 that the PDCoV nsp9-AN7 and PEDV nsp9-C59A mutants are monomeric in solution. 

38 In addition, electrophoretic mobility shift assays and microscale thermophoresis 

39 analysis indicate that the monomeric forms of PDCoV nsp9 and PEDV nsp9 still have 

40 nucleic acid binding affinity, but it is lower than that of the wild type. Our results 

41 show that the diverse dimerization forms of coronavirus nsp9 proteins enhance their 

42 nucleic acid binding affinity. 

43 Importance 

44 Coronaviruses cause widespread respiratory, gastrointestinal, and central nervous 
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45 system diseases in humans and other animals, threatening human health and causing 

46 economic loss. Coronavirus nsp9, a member of the replication complex, is an 

47 important RNA binding subunit in the RNA-synthesizing machinery of all 

48 coronaviruses. However, the mechanisms of the dimerization and nucleic acid binding 

49 of nsp9 remain elusive. In the study we have determined the nsp9 crystal structures of 

50 PDCoV and PEDV. We first find that the N-finger of nsp9 from PDCoV plays a 

51 critical role in dimerization. Meanwhile, PEDV nsp9 is distinguished by the presence 

52 of a disulfide bond in the dimer interface. This study provides a structural and 

53 functional basis for understanding the mechanism of dimerization and show that the 

54 diverse dimerization modes of coronavirus nsp9 proteins enhance their nucleic acid 

55 binding affinity. Importantly, these findings may provide a new insight for antiviral 

56 drug development. 

57 Introduction 

58 Coronaviruses (CoVs) are enveloped viruses, with single-stranded RNA genomes 

59 of positive polarity approximately 30 kb in length, that can be divided into four 

60 genera: a-CoV, |3-CoV, y-CoV and 5-CoV (1, 2). The ideal hosts of a-CoV and P-CoV 

61 are mammals, and y-CoV primarily infects birds, while 5-CoV has been identified in 

62 both mammals and birds (3). In the past 15 years, there have been two epidemics of 

63 fatal pneumonia in addition to other outbreaks caused by CoVs. The severe acute 

64 respiratory syndrome CoV (SARS-CoV) emerged in China in 2002, and the Middle 

65 East respiratory syndrome CoV (MERS-CoV) emerged in Saudi Arabia in 2012 (4-6). 
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66 In addition, the PEDV and the PDCoV, two re-emerging and emerging epizootic 

67 swine viruses, have caused major economic losses in Asia and the United States (7-9). 

68 Co Vs encode the ORF la and ORF lab polyproteins, which are then processed 

69 into 15-16 nonstructural proteins (nsps) by two virus-encoded proteinases, the 

70 papain-like and 3C-like proteinases (10-14). All these nsps, except for nspl and nsp2, 

71 are considered essential for transcription, replication and translation of the viral RNA 

72 (15-17). Nsp9 with nsp7, nsp8 and nsplO localizes within the replication complex and 

73 is likely a member of the replication complex (18). The deletion of nsp9 in MHV 

74 prevents RNA synthesis and productive virus infection, while fusion of a nsp9-10 

75 oligoprotein results in viability but attenuated growth, indicating that the mature form 

76 of the nsp9 protein is critical for viral replication (19). Another study showed that 

77 nsp9 is a nucleic acid binding protein that is essential for replication (20). Meanwhile, 

78 the dimerization of nsp9 is critical for viral replication (21). For nsp9, there are five 

79 crystallographic structures that showed a variety of dimeric interfaces (22-25). 

80 However, the dimerization mechanism of all CoV nsp9s, whether from emerging 

81 viruses or not, is not so clear yet. 

82 In this study, we elucidated the crystal structures and functions of PDCoV nsp9 

83 and PEDV nsp9. The critical amino acids or motifs for nsp9 dimerization and nucleic 

84 acid binding were further investigated. 

85 Results 

86 Overall structure of the nsp9 
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87 The PDCoV nsp9 and PEDV nsp9 structures are refined to 1.80 A and 2.89 A 

88 resolution, respectively (Table 1). Their space groups are P2i and P64, respectively. 

89 With the exception of the regions encompassing amino acids 72-79, and 106-109 of 

90 PDCoV nsp9 and 1-6, 33, 55-58, and 106-108 of PEDV nsp9, all residues of both 

91 nsp9 proteins can be built in the final models (Fig. 1A, IB). The crystal structures of 

92 PDCoV nsp9 and PEDV nsp9 reveal that their monomers contain seven antiparallel 

93 P-strands and one a-helix appended at the C-terminus of the polypeptide chain 

94 (residues 91-103), as well as loops connecting the P-strands (Fig. 1A, IB). All seven 

95 P-strands form a P-sandwich, with sheet pi-3 interleaved between sheets P4-5 and 

96 P6-7 (Fig. 1C). 

97 The N-finger plays an important role in the dimerization of PDCoV nsp9 

98 Based on the PDCoV nsp9 crystal structure, the dimer interface is formed by the 

99 N-fingers and the parallel association of the C-terminal a-helices. A total surface area 

100 of 924.3 A 2 is buried upon dimerization (Fig. 2A). Although the N-terminus of nsp9 

101 may be flexible in other coronaviruses, we see clear electron density for this region in 

102 the 2Fo - Fc map (Fig. 2B). One hydrogen bond forms between the Feu4 main chain 

103 of subunit B and the Arg70 main chain of subunit A, and one hydrogen bond forms 

104 between the Arg7 main chain of subunit A and the Serl03 side chain of subunit B (Fig. 

105 2C, 2D). Feu4 and Arg7, which form part of the N-finger (Fig. ID), clip onto the edge 

106 of the inner P-sheet (Arg70 on strand 6) and the edge of the a-helix (Serl03) from its 

107 dimer partner (Fig. 2C, 2D). Further stabilization is derived from the close packing of 
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108 the helices from the two monomers because the heart of the dimer interface consists 

109 of two glycines (Gly96, GlylOO), which form a hydrophobic interface and stabilize 

110 the dimer (Fig. 2C, 2D). 

111 To determine whether the PDCoV nsp9 dimer found in the crystal is also present in 

112 solution, we performed size-exclusion chromatography (SEC) and analytical 

113 ultracentrifugation (AUC) experiments to confirm the oligomeric state of PDCoV 

114 nsp9 in solution. Our data show that PDCoV nsp9 eluted in two peaks with calculated 

115 molecular masses of approximately 12.64 kDa and 25.33 kDa, corresponding to a 

116 monomer and a dimer, respectively (Fig. 2E). Meanwhile, the AUC data, which are 

117 consistent with the SEC results, show that the PDCoV nsp9 protein existed in both 

118 monomeric and dimeric states in solution (Fig. 2F). 

119 Then, we determine whether the N-finger plays important roles in nsp9 dimer 

120 formation. The nsp9-AN7 protein was prepared, and SEC and AUC assays were used 

121 to analyze the oligomeric states of the two mutants. The SEC data show that the nsp9- 

122 AN7 mutant eluted in only one peak, which corresponded to a monomer (Fig. 2G, 2H). 

123 Meanwhile, the nsp9-L4A/L6A/R7A/N8A mutant has a monomeric form (Fig. 2G 

124 2H). The AUC results reveal a monomeric state of the nsp9-AN7 mutant with a 

125 calculated molecular weight of 10.94 kDa, a sedimentation coefficient (S 2 o,w) of 1.349, 

126 and a frictional ratio (f/fo ) of 1.317 (Fig. 2F). In addition, PDCoV nsp9-AN7 is 

127 crystallized in the space group P2,. The structure is determined by molecular 

128 replacement using the structure of wild-type PDCoV nsp9 as the search template and 

129 is refined at 1.99 A resolution (Table 1). The structure of PDCoV nsp9-AN7 remains 
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130 basically consistent with wild-type PDCoV nsp9 except for the N-finger. The 

131 structural superposition of the wild-type nsp9 monomer with the nsp9-AN7 mutant 

132 monomer shows a root mean square deviation (RMSD) of 0.9 A between the 90 Ca 

133 atoms (Fig. 3). According to these results, the N-finger plays an important role in 

134 maintaining the dimer stability. 

135 Meanwhile, the SEC data show that the nsp9-G96E and nsp9-G100E mutants have 

136 a monomeric form (data not shown), indicating that the Gly96 and GlylOO of 

137 GXXXG motif can also impact the dimerization. 

138 The importance of the dimerization-related amino acids in nucleic acid binding of 

139 PDCoV nsp9 

140 Seven mutants, including nsp9-L4A, nsp9-L6A, nsp9-R7A, nsp9-N8A, 

141 nsp9-L4A/L6A/R7A/N8A, nsp9-G96E and nsp9-G100E, were designed to test 

142 whether the N-finger and the glycines in the a-helix of PDCoV nsp9 are related to 

143 nucleic acid binding. Then, two EMSA assays were performed to examine the nucleic 

144 acid binding affinity of each mutant, and microscale thermophoresis (MST) assays 

145 were used to determine the quantitative value of the binding affinity. The results 

146 reveal that Leu4 and Leu6 are not important amino acids for nucleic acid binding. The 

147 wild-type PDCoV nsp9 protein bound to ssDNA with a Kd of 410 pM, and the ssDNA 

148 binding affinities of the nsp9-R7A, nsp9-G96E and nsp9-G100E mutants are 

149 2.5~5-fold weaker than that of wild-type PDCoV nsp9. In addition, 

150 nsp9-L4A/L6A/R7A/N8A shows almost no binding affinity (Fig. 4A, 4B, 4C). The 

151 results of the EMSA and MST assays indicate that Arg7 in the N-finger and the 
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152 glycines in the a-helix of PDCoV nsp9 play important roles in nucleic acid binding. 

153 Based on the sequence alignment, the N-finger motif and GXXXG motif of the 

154 protein are highly conserved (Fig. ID). To determine whether these motifs are 

155 actually critical for the dimerization of other re-emerged coronavirus nsp9s, the 

156 PED V nsp9 should be further investigated. 

157 The disulfide bond is critical for the dimerization of PEDV nsp9 

158 Two potential dimer interfaces are observed in the PEDV nsp9 crystal. One of the 

159 dimers observed in the crystals is formed by the parallel association of the C-ternrinal 

160 a-helices (Fig. 5A). The helices from the two monomers pack together closely 

161 because the heart of the dimer interface consists of three glycines (Gly95, Gly99 and 

162 Glyl02), which form a hydrophobic interface and stabilize the dimer. The a-helix 

163 atoms of the two Gly95 residues show the closest distance of 3.3 A, and the distance 

164 between the two Glyl02 residues is 10.3 A, with a very weak hydrophobic interaction 

165 (Fig. 5B). Compared with PDCoV nsp9, the N-finger of PEDV nsp9 may be flexible 

166 and cannot be built in the model. 

167 Another dimer of PEDV nsp9 forms a disulfide-linked homodimer (Fig. 5C), in 

168 which the two monomers are linked by the disulfide bond formed between the Cys59 

169 residues. Two P-strands, 4 and 5, are also involved in dimerization through the 

170 formation of two hydrogen bonds between the Cys59 side chain and the main chain of 

171 residue 61, two H-bonds between the Asn60 main chain and the side chain of residue 

172 61, and hydrophobic interactions among residues Lys52, Gly58, Cys59, Asn60 and 
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173 Thr61 (Fig. 5D). 

174 We applied biochemical techniques to determine the oligomeric state of PEDV 

175 nsp9 in solution. We performed AUC and confirmed the monomeric and dimeric 

176 states of PEDV nsp9, with calculated molecular weights of 14.22 kDa and 20.92 kDa, 

177 respectively. The sedimentation coefficients (S 2 o,w) are 1-489 and 1.927, respectively, 

178 and the frictional ratio (f/f 0 ) is 1.421 (Fig. 5E). The AUC assays show that monomers 

179 and dimers are the primary forms of PEDV nsp9 protein in solution, coexisting with a 

180 small number of higher oligomers. 

181 To determine whether Cys59 plays important roles in nsp9 dimer formation, one 

182 mutant, namely, nsp9-C59A, was designed. Then, the AUC assay was used to analyze 

183 the oligomeric states of the mutant. The AUC results for the nsp9-C59A mutant reveal 

184 a monomeric state, with a calculated molecular weight of 12.2 kDa, a measured 

185 sedimentation coefficient (S 2 o,w) of 1.372, and a frictional ratio (f/fn) of 1.393 (Fig. 

186 5E). Then, PEDV nsp9 and the nsp9-C59A mutant were treated with SDS-PAGE 

187 loading buffer without DTT, a reagent that can be used to disrupt the intramolecular or 

188 intermolecular disulfide bond formed by cysteines, and SDS-PAGE was performed 

189 (Fig. 5F). We used freshly prepared PEDV nsp9 protein from E. coli to perform the 

190 AUC and SDS-PAGE which have proved the disulfide bond may be formed in E. coli 

191 and indicated that the PEDV nsp9 dimer has a disulfide bond and Cys59 is an 

192 important residue for PEDV nsp9 dimer formation. In addition, to determine whether 

193 the PEDV nsp9 protein has the disulfide bond in mammalian cells, HEK293T cells 

194 were transfected with an empty vector or wild-type PEDV nsp9 plasmids. At 48 h 
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195 post-transfection, cells were treated with R1PA lysis buffer. Then the lysates were 

196 treated with SDS-PAGE loading buffer with or without DTT and immunoblotting was 

197 performed. The results show that the PED V nsp9 is monomer in cells and indicate that 

198 the disulfide-bonded nsp9 dimer might be an artifact in the infected cells (Fig. 5G). 

199 To further validate the importance of the disulfide bond in dimer formation, we 

200 analyzed the structure of the nsp9-C59A mutant. The crystals belong to space group 

201 P 432 i 2 . The structure is determined by molecular replacement and refined to 3.0 A 

202 resolution (Table 1). The crystal structure of the nsp9-C59A mutant monomer also 

203 shows 7 P-strands and an a-helix appended at the C-terminus (Fig. 5H). Neither a 

204 disulfide bond nor any analogous dimer interface to that formed by the disulfide bond 

205 is observed in the structure. The structural superposition of the wild-type nsp9 

206 monomer with the nsp9-C59A mutant monomer yields 94 Ca atoms with an r.m.s. 

207 deviation of 0.2 A (Fig. 5H). 

208 Taken together, the crystal structure and biochemical assays reveal the crucial role 

209 of Cys59 in PEDV nsp9 dimer formation, and the disulfide-linked homodimer is the 

210 primary form of the PEDV nsp9 dimer. However, PEDV nsp9 is monomer in cells and 

211 the disulfide-bonded nsp9 dimer might be an artifact in the infected cells. 

212 The importance of dimerization-related amino acids in nucleic acid binding of 

213 PEDV nsp9 

214 To test whether Cys59 and the glycines in the a-helix (Gly95, Gly99, Glyl02) are 

215 related to nucleic acid binding of PEDV nsp9, we designed three mutants, nsp9-C59A, 

216 nsp9-G95E/G99E/G102E and nsp9-C59A/G95E/G99E/G102E. Two EMSA assays 
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217 were performed to examine the nucleic acid binding affinity of each mutant. The 

218 ssDNA binding affinity of the nsp9-C59A mutant is somewhat weaker than that of the 

219 wild-type nsp9 protein, whereas the nsp9-G95E/G99E/G102E mutant shows very 

220 weak binding affinity, and the nsp9-C59A/G95E/G99E/G102E mutant shows almost 

221 no binding affinity (Fig. 6A, 6B). Furthermore, MST assays were performed to obtain 

222 quantitative measurements of the binding affinities. The results show that the 

223 wild-type PEDV nsp9 protein bound to ssDNA with a Kd of 145 liM, the nsp9-C59A 

224 mutant shows a 2.7-fold reduction in ssDNA binding, the nsp9-G95E/G99E/G102E 

225 mutant shows a 14-fold reduction, and the nsp9-C59A/G95E/G99E/G102E mutant 

226 shows much weaker binding affinity, with a 36-fold reduction (Fig. 6C). The results 

227 of the EMSA and MST assays indicate that Cys59 and the glycines in the a-helix of 

228 PEDV nsp9 play important roles in nucleic acid binding. 

229 The importance of positively charged amino acids in nucleic acid binding of 

230 PEDV nsp9 

231 The surface electrostatic potential analysis of PEDV nsp9 reveals a highly 

232 positively charged patch suggestive of a potential site for nucleic acid binding (Fig. 

233 7A). To determine whether this surface contributes to the ssDNA binding activity of 

234 PEDV nsp9, we introduced individual substitutions of positively charged residues, 

235 and their effects on ssDNA binding were assessed by EMSA and MST assays. In this 

236 study, residues LyslO, Arg68, Lys69 and Argl06 were mutated to alanine because of 

237 their highly positively charged potential and their exposure on the surface of the nsp9 

238 structure. One aromatic amino acid, Tyr82, was also mutated to alanine (Fig. 7A). The 
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239 EMSA results show that the nsp9-K10A, nsp9-R68A/K69A, and nsp9-R106A mutants 

240 show slightly weaker ssDNA binding activity, whereas the 

241 nsp9-K10A/R68A/K69A/R106A mutant reveals a significant loss of ssDNA binding 

242 affinity, and the nsp9-Y82A mutant shows clearly enhanced ssDNA binding affinity 

243 (Fig. 7B). The MST assay indicates that the nsp9-K10A and nsp9-R106A mutants 

244 have 1.3-fold and 2.7-fold reductions in ssDNA binding affinity compared with the 

245 wild-type protein, respectively, whereas the nsp9-K10A/R68A/K69A/R106A mutant 

246 shows much weaker binding affinity, with a 7.2-fold reduction, and nsp9-Y82A shows 

247 much stronger binding affinity, with an 8.0-fold increase (Fig. 1C). The ssDNA 

248 binding activity of the nsp9-R68A/K69A mutant could not be measured by the MST 

249 assay because of the protein’s sedimentation characteristics. The results of the EMSA 

250 and MST assays indicate that the positively charged surface of the PEDV nsp9 does 

251 indeed contribute to the ssDNA interaction. 

252 Discussion 

253 Diverse dimerization of coronavirus nsp9 

254 In the emerging CoV PDCoV, we first confirm that the N-finger motif is critical 

255 for dimerization. Moreover, the dimerization of PDCoV nsp9 is also supported by the 

256 parallel association of the C-terminal a-helix via strong hydrophobic interactions. 

257 Although the monomeric structures of nsp9 from PDCoV exhibited significant 

258 similarity to those from other CoVs, the dimerization modes showed high diversity. 

259 IBV nsp9 forms a homodimer via interactions across a hydrophobic interface, which 
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consists of two parallel alpha-helixes near the carboxy terminus of the protein (25). 
For the SARS-CoV nsp9 protein, the N-terminal extended |) chain and the C-terminal 
a-helices stabilize its interface (24). The N-terminal extended P chain of SARS nsp9 
is supported by the extended tag, which may lead to the unclear function of its 
N-finger motif. Here, SARS-CoV nsp9-AN7 mutant is monomeric in solution which 
is consistent with wild-type PDCoV nsp9-AN7 (data not shown). The N- and 
C-termini of the protein are more conserved than the central core region, and the 
GXXXG motif is strictly conserved (24, 26). In other proteins, mutation of the Gly 
residues in the motif led to the decreased or the complete loss of dimerization of 
transmembrane helices (21, 27, 28). The GXXXG motif of nsp9 is important for 
maintaining the dimer form. PDCoV nsp9 AN7 without the N-finger forms the 
monomeric stage. We conclude that the N-finger motif and the GXXXG motif are 
both important for dimerization of PDCoV nsp9. 

In the dimer interface of PEDV nsp9, similar with HCoV-229E nsp9, there is a 
disulfide bond. However, the cysteine is located at a different position, resulting in a 
very different dimer conformation. PEDV nsp9 primary forms a disulfide-linked 
homodimer, in which the two monomers are linked by the disulfide bond formed 
between the Cys59 residues. For HCoV-229E nsp9, a disulfide bond is involved in 
dimerization (23), which is consistent with our results. However, the weak effect of 
the C59A mutation on nucleic acid binding indicate that dimerization of PEDV nsp9 
by disulfide formation may not be a major form in the infected cells. Moreover, the 
overall milieu is reductive, and disulfide bonds are rare in the cytoplasm of the 
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282 infected cell (29). The biological function of the disulfide bond should be further 

283 investigated by a reverse genetics system. 

284 Nucleic acid binding by nsp9 

285 The crystal structures of CoV nsp9s share great similarity in the 

286 oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to 

287 ssDNA or ssRNA (30). Our data show that PDCoV nsp9 and PEDV nsp9 both have 

288 binding activity with nucleic acids, which is consistent with other CoV nsp9 proteins 

289 (21, 23, 25). In addition, the dimeric state of the PDCoV nsp9 and PEDV nsp9 can 

290 increase the binding activity with nucleic acids. Our EMSA and MST experiments 

291 also suggest that the N-finger of PDCoV nsp9 can influence the affinity to nucleic 

292 acids. In a previous study, the SARS-CoV nsp9 dimer was shown to be required for 

293 viral growth (21). The nsp9 dimerization is critical for the function of this replicase 

294 protein for the IBV infectious clone system (31). We hypothesize that the dimer of 

295 PDCoV nsp9 and PEDV nsp9 may be critical for viral replication. Whether the 

296 N-finger of PDCoV nsp9 or the cysteine of PEDV nsp9 affects viral replication 

297 should be further studied by using reverse genetic approaches. 

298 Nucleotides interact with proteins primarily via aromatic amino acid side chains, 

299 hydrophobic side chains or the aliphatic portions of polar and positively charged 

300 groups, such as lysine and arginine (30). A previous study of SARS-CoV nsp9 

301 suggested that the most likely site of RNA binding is on the loops L23, L45 and L7H1, 

302 as this face is accessible in the preferred helix-stabilized dimer but largely occluded in 
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303 the putative (i sheet-stabilized dimer (24). Based on the structure of SARS-CoV nsp9, 

304 we observe that positively charged amino acids are present on the loops. In our study, 

305 the PEDV nsp9-K10A, nsp9-R68A/K69A and nsp9-R106A mutants mildly affect 

306 ssDNA binding activity, whereas the nsp9-K10A/R68A/K69A/R106A mutant reveals 

307 a significant reduction in ssDNA binding (Fig. 7B, 7C), which confirmed the 

308 importance of highly positively charged residues exposed on the protein surface in 

309 ssDNA binding. Many amino acids in a single protein may participate in ssDNA 

310 binding, and thus, it is reasonable that a single-residue mutation could have a 

311 moderate effect on ssDNA binding activity. We hypothesize that the potential site of 

312 RNA binding is on the face of PEDV nsp9 present on strands pi and P6 (Fig. IB). 

313 Surprisingly, the nsp9-Y82A mutant strongly enhances the ssDNA binding affinity 

314 (Fig. 7B, 7C) and F6A and N8A of PDCoV nsp9 unexpectedly induced an 

315 enhancement of the ssDNA binding affinity, which may be caused by 

316 decreasing steric hindrance. Whether the nsp9-Y82A mutant affects viral replication 

317 should be further investigated. In addition, we have attempted to solve the structure of 

318 the nsp9-RNA or DNA complexes. Unfortunately, we were unable to obtain the 

319 crystals of the nsp9-RNA or DNA complexes. 

320 Nsp9 is able to bind ssDNA or ssRNA (22, 25), although binding of ssRNA is 

321 expected to be the native function. In the infected cell, the coupling/compartnrentation 

322 of the viral RNA synthesis with the RNA-binding function of nsp9 might render RNA 

323 versus DNA specificity unnecessary (22). Moreover, as a member of the replication 

324 complex, nsp9 may not have a specific binding sequence but may act in conjunction 
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with other nsps as a processivity factor (21). According to a model for the initiation of 
coronavirus negative-strand RNA synthesis, a protein complex (including nsp8 and 
nsp9) binds to viral RNA initially (32). The potential site of RNA binding may 
provide a new insight into the binding of the replication/transcription complexes to 
viral RNA. 

CoV nsp9 has diverse dimerization, while its function of enhancement nuclear 
binding affinity is conservative (22-25). We propose that most nsp9 proteins 
originated from ancestral coronavirus, which have N-finger motifs and GXXXG 
motifs that both play critical roles in dimerization. The N-terminal loops and GXXXG 
motifs of PDCoV, IBV and SARS-CoV nsp9 are also involved in dimerization, while 
the extra amino acids (the residues of fusion tag) in the N-finger may block the fully 
function of the N-finger (22, 24, 25). Interestingly, PEDV and HCoV-229E evolve 
different disulfide bonds to enhance the dimerization (23). Using diverse dimerization 
strategies, nsp9 might increase the nucleic acid binding interface and then promote its 
nucleic acid binding affinity, which might stabilize nascent viral RNAs during 
replication or transcription, thus providing protection from nucleases (22, 24). 

In summary, CoV nsp9s have diverse forms of dimerization that promote their 
biological function, which may help elucidate the mechanism underlying CoVs 
replication and contribute to the development of antiviral drugs. 


344 Materials and methods 

345 Protein production and crystallization 
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346 The full-length nsp9 genes of PDCoV and PEDV were amplified by PCR from 

347 virus-derived cDNA fragments and cloned separately into pET-42b. Both nsp9s were 

348 expressed with a C-terminal His^ tag in E. coli BL21 (DE3). Eligh-quality soluble 

349 PDCoV nsp9 protein and PEDV nsp9 protein were obtained by adding 1 mM IPTG 

350 (isopropyl-P-D-thiogalacto-pyranoside) to LB when the bacteria had grown to an 

351 optical density at 600 nm of approximately 0.6 at 37 °C, followed by shaking at 37 °C 

352 for 4 h and at 27 °C for 10 h, respectively. For protein purification, the E. coli cells 

353 were harvested, resuspended with phosphate-buffered saline (PBS), and lysed via 

354 passage through an AH-1500 homogenizer. After centrifugation at 8,500 rpm for 40 

355 min, the supernatant was filtered and loaded onto a HisTrap HP column (GE 

356 Healthcare). The PDCoV nsp9-AN7, PEDV nsp9-C59A and other nsp9 mutants were 

357 expressed and purified as described above. 

358 Crystallization screening was performed by the sitting drop vapor diffusion 

359 method at 20 °C. PDCoV nsp9 (5.0 mg/ml) was mixed at a 1:1 ratio with 

360 crystallization cocktails. Wild-type protein crystals and Se-Met-labeled nsp9 crystals 

361 appeared within 3 days at 20 °C in 2.0 M sodium chloride and 0.1 M citric acid (pH 

362 3.5). The PDCoV nsp9-AN7 mutant protein crystals appeared within 3 days at 20 °C 

363 in 0.2 M sodium chloride, 0.1 M Na 2 HP 04 :citric acid (pH 4.2) and 20% (w/v) PEG 

364 8000. After optimization, the wild-type crystals or Se-Met-labeled nsp9 crystals were 

365 stabilized in 3.0 M sodium chloride and 0.1 M citric acid (pH 3.5), and the nsp9-AN7 

366 crystal was stabilized in 0.1 M Na 2 HP 04 :citric acid (pH 4.2) and 22% (w/v) PEG 

367 8000. Meanwhile, PEDV nsp9 (6.5 mg/ml) was mixed at a 1:1 ratio with 
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368 crystallization cocktails. Wild-type protein crystals appeared within 40 days at 20 °C 

369 in 0.1 M Tris (pH 8.5) and 25% v/v tert-butanol, and PEDV nsp9-C59A mutant 

370 protein crystals appeared within 2 days at 20 °C in 0.2 M sodium thiocyanate (pH 6.9) 

371 and 12% PEG 3350. After optimization, the wild-type crystals were stabilized in 0.3 

372 M Tris (pH 8.5) and 13% v/v tert-butanol. and the nsp9-C59A mutant crystals were 

373 stabilized in 0.25 M sodium thiocyanate (pH 7.7), 10% PEG 3350, and 2% ethylene 

374 glycol. There was no reductant (DTT) in the crystallization set-up. 

375 Data collection and structure determination 

376 X-ray diffraction data were collected at beamline BL17U at the Shanghai 

377 Synchrotron Radiation Facility. The structure of PDCoV nsp9 was solved using the 

378 single-wavelength anomalous dispersion (SAD) method and a Se-Met-derivative nsp9 

379 wild-type protein. Both potential selenium atoms in the nsp9 monomer were located, 

380 and the initial phases were calculated using the AutoSol program from the PHENIX 

381 software suite (33). The molecular replacement program HKL-3000 (34) was used to 

382 solve the structure of PEDV nsp9, with a monomer of HCoV-229E nsp9 (PDB 

383 ID:2J97) as the search model. Manual model rebuilding was performed using COOT 

384 (35) and then refined in the PHENIX software suite. The figures were created using 

385 PyMOL (Schrodinger). 

386 Structural analysis and sequence alignment 

387 Detailed molecular interactions between the two monomers of nsp9 were 

388 determined using LIGPLOT (36), and the other structure figures were generated using 
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389 PyMOL (Schrodinger). The buried surface areas between the two monomers and the 

390 root mean square deviation (RMSD) were analyzed using PDBePISA 

391 (http://pdbe.org/pisa/) and PDBeFold (http://pdbe.org/fold/), respectively. Additionally, 

392 the amino acid sequences of coronavirus nsp9 were aligned using ClustalW2 (37) and 

393 visualized with the ESPript 3 server ( http://espript.ibcp.fr ) (38). The analyzed viruses 

394 (abbreviation; NCBI accession number) were as follows: PEDV (porcine epidemic 

395 diarrhea virus; AHZ94880), PDCoV (porcine delta coronavirus; APG38197), TGEV 

396 (transmissible gastroenteritis virus; AML22776), HCoV-229E (human coronavirus 

397 229E; NP_073550), SARS-CoV (SARS coronavirus Shanhgai LY; AAP82976), 

398 MERS-CoV (middle east respiratory syndrome-related coronavirus; AHI48749), 

399 ME1V (murine hepatitis virus strain A59; AC072881), IBV (Infectious bronchitis 

400 virus; ACJ12833). 

401 Size-exclusion chromatography (SEC) 

402 Oligomerization of wild-type (0.1 mg) and mutant (AN7 and 

403 L4A/L6A/R7A/N8A) (0.1 mg) PDCoV nsp9 proteins was analyzed using a Superdex 

404 75 10/300GL column (GE Healthcare) with a buffer containing 20 mM Tris-HCl (pH 

405 7.4) and 200 mM NaCl at a flow rate of 0.6 ml/min (4 °C). Wild-type and mutant 

406 (AN7 and L4A/L6A/R7A/N8A) nsp9 proteins eluted in different fractions were 

407 analyzed by SDS-PAGE. 

408 Immunoblotting 

409 To obtain high expression in eukaryotic cells, PEDV nsp9 wild-type, flanked by 
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430 


an C-terminal hemagglutinin (HA) tag, was cloned into pCAGGS vector using the 
EcoRI and Xhol restriction sites. 

HEK293T cells cultured in 6-well plates (Corning, Tewksbury, MA, USA) were 
transfected with an empty vector or wild-type PEDV nsp9 plasmids (2.5 pg) using 
Lipofectamine 2000. At 48 h post-transfection, cells were treated with RIPA lysis 
buffer (Beyotime). Then the lysates were treated with SDS-PAGE loading buffer with 
or without DTT. For the immunoblot analysis, 10 pi of lysate was electrophoresed on 
12% SDS-PAGE gels (Bio-Rad). Proteins were transferred to PVDF membranes 
(Bio-Rad) according to the manufacturer's recommendations. Membranes were 
incubated in blocking buffer (5% nonfat dried milk in TBS-Tween [50 mM Tris, pH 
7.6, 150 mM NaCl]) for 3h at room temperature (RT). Anti-HA antibody (Ab; MBL) 
was diluted 1:10,000 in blocking buffer and incubated with the membrane for 3h at 
RT. Membranes were washed three times for 15min at RT with TBS-Tween. Goat 
anti-mouse horseradish peroxidase-conjugated antibodies (BOSTER) were diluted 
1:5,000 in blocking buffer and incubated with the membrane for lh at RT. Membranes 
were washed three times for 15 min at RT with TBS-Tween. Then, the membranes 
were visualized using an enhanced chemiluminescence system (Amersham Imager 
600, GE Healthcare). 

Analytical ultracentrifugation (AUC) 

Sedimentation velocity was assessed with an XL-I analytical centrifuge (Proteome 
Lab, Huazhong Agricultural University, Wuhan) at 60,000 rpm and 18 °C in 400 pi 
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431 double sector cells for sedimentation analytical ultracentrifugation. The sedimentation 

432 boundary was monitored every minute at a wavelength of 280 nm. Samples were at a 

433 concentration of 1 mg/ml in 20 mM Tris, 200 mM NaCl (pH 7.4) buffer. Data were 

434 interpreted with the model-based distribution of Lamm equation solutions c(s) using 

435 the software Sedfit (39). Weight-averaged molar masses were calculated using Vector 

436 NTI software. 

437 Electrophoretic mobility shift assay (EMSA) 

438 An electrophoretic mobility shift assay was performed as described previously (40). 

439 A 5'-Cy5-labeled 36-mer ssDNA oligonucleotide 

440 (TAGTACCGCCACCCTCAGAACCTTTTTTTTTTTTTT) (synthesized by Sangon 

441 Biotech, Shanghai, China) was used for the gel shift assay (22). Initially, 5 liM ssDNA 

442 was incubated with different concentrations of PDCoV and PEDV nsp9s and the 

443 mutant proteins in 10 mM Tris (pH 8.0) and 200 mM NaCl for 45 min at room 

444 temperature, followed by the addition of 10% glycerol to the mixture. Samples were 

445 then run on 6.5% nondenaturing TBE polyacrylamide gels for 37 min at a voltage of 

446 180V, and the results were determined with an FLA-2000 fluorescent image analyser 

447 (Fuji, Stamford, CT). 

448 Zone-interference gel electrophoresis was described in the publication by 

449 Ponnusamy et al (23). The ssDNA was dissolved at a concentration of 2 nmol before 

450 loading onto a 1% agarose gel. The gel was run for 50 min with running buffer (20 

451 mM Tris, 50 mM boric acid, 1 mM NaEDTA, pH 8.3) at 100 mA, 4°C. With the poles 
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468 


of the electrodes interchanged, 1 nmol nsp9 protein was loaded and run for another 50 
min. After electrophoresis, the gel was fixed in 3.5% (w/v) a-sulfosalicilic acid, 10% 
(w/v) trichloroacetic acid, until the dye turned yellow. For the detection of protein 
bands, the gel was washed for 15 min in 15% (v/v) ethanol, 8% (v/v) acetic acid, and 
stained for 30 min with 0.25% (w/v) Coomassie brilliant blue in the same solution 
with additional 10% (v/v) methanol. The gel was washed in 15% (v/v) ethanol, 8% 
(v/v) acetic acid, and stored in 10% (v/v) acetic acid. 

Microscale thermophoresis (MST) assay 

Specific binding between nsp9 or the nsp9 mutant proteins and ssDNA was 
measured by MST assays as previously described (41, 42). Briefly, nsp9 or nsp9 
mutant proteins were serially diluted with buffer containing 10 mM Tris (pH 8.0) and 
200 mM NaCl, and then, 10 nM Cy5-labeled ssDNA diluted with buffer containing 10 
mM Tris (pH 8.0), 200 mM NaCl and 0.05% Tween-20 was added. The mixtures were 
loaded into NT.115 standard capillaries (Nanotemper Technologies), and 
thermophoresis was carried out at room temperature, 100% LED, and 20% IR-laser 
power with a Monolith NT.115 (Nanotemper Technologies). The data were analyzed 
with the Nanotemper Analysis software v.1.2.101. 


469 Accession numbers 

470 The atomic coordinates of wild-type PDCoV nsp9, PDCoV nsp9-AN7, wild-type 

471 PEDV nsp9 and the nsp9-C59A mutant have been deposited in the RCSB Protein 

472 Data Bank with the accession codes 5YM6, 5YM8, 5HIZ and 5HIY, respectively. 
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610 Figure legends 

611 Figure 1. Stereoview of the structures of PDCoV nsp9 and PEDV nsp9. (A) 

612 Ribbon representation of the PDCoV nsp9 monomer. (B) Ribbon representation of the 

613 PEDV nsp9 monomer. (C) Schematic illustration of nsp9 topology. The (J strands are 

614 drawn as arrows and a-helices as cylinders. (D) Sequence alignment of CoV nsp9 

615 homologs to PDCoV nsp9 and PEDV nsp9. Identical residues are highlighted in red, 

616 and conserved residues are shown in yellow. The table was produced using ESpript3.0 

617 with the secondary structure elements for PDCoV nsp9 and PEDV nsp9 assigned 

618 using DSSP (43). Residues boxed in red are completely conserved. 

619 Figure 2. The N-finger of the PDCoV nsp9 has a strong effect on dimerization. (A) 

620 The two monomers are colored cyan and magenta, respectively, and depicted in 

621 cartoon and surface representations. (B) 2Fo - F electron density map of the N-finger 
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622 at 1.5a. This region includes ten residues. (C) Dimerization interface of PDCoV nsp9. 

623 The interacting residues (Leu4, Arg7, Arg70, Gly96, GlylOO and Serl03) between 

624 two monomers are shown with stick illustrations. (D) All the interacting residues 

625 between subunits A (blue) and B (magenta) were determined using LIGPLOT. (E) 

626 Calculated molecular weights of the PDCoV nsp9 protein peaks with the values 

627 obtained for known calibration standards (GE Healthcare). The calculated molecular 

628 weight of the PDCoV nsp9 peaks was determined by fitting to the calibration curve 

629 (K av =volumes of elution [V es ]/24); the volumes of elution of 10.87 ml (approximately 

630 25.33) and 12.49 ml (approximately 12.64) are indicated by arrows. (F) Velocity AUC 

631 analysis of the PDCoV nsp9 and mutant proteins. The molecular mass represented by 

632 each major peak is indicated, and the biophysical data are shown in the table. (G) Size 

633 exclusion experiment with the wild-type and mutants (AN7 and L4A/L6A/R7A/N8A) 

634 of PDCoV nsp9. (H) SDS-PAGE analysis of wild-type and mutant 

635 (L4A/L6A/R7A/N8A and AN7) nsp9. The elution volume is described in panel E. 

636 Molecular mass markers are shown. 

637 Figure 3. Illustration of the superimposition of PDCoV nsp9-WT and PDCoV 

638 nsp9-AN7 monomer structures. The PDCoV nsp9-WT and PDCoV nsp9-AN7 

639 monomers are colored orange and blue, respectively, and depicted in cartoon 

640 representation. The structural superposition shows a root mean square deviation 

641 (RMSD) of 0.9 A between the 90 Ca atoms. 

642 Figure 4. The N-finger motif can affect the ssDNA binding abilities of PDCoV 

643 nsp9. (A) The ssDNA binding abilities of nsp9-WT, nsp9-AN7, nsp9-G96E, 
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644 nsp9-G100E, nsp9-L4A, nsp9-L6A, nsp9-R7A, nsp9-N8A and 

645 nsp9-L4A/L6A/R7A/N8A as determined by EMSA. Each protein was assayed at 

646 different concentrations (0 pM, 25 pM, 50 pM) with 5 pM 5'-Cy5-labeled 36-mer 

647 ssDNA. (B) Zone-interference gel electrophoresis illustrating the association of 

648 nsp9-WT, nsp9-AN7 and nsp9-L4A/L6A/R7A/N8A with 36-mer ssDNA. The red 

649 dotted lines represent the position of the bands of nsp9-WT, nsp9-AN7 and 

650 nsp9-L4A/L6A/R7A/N8A without reaction with ssDNA. (C) The ssDNA binding 

651 affinities of nsp9-WT and mutants were measured by MST assays. Each protein was 

652 assayed in twofold concentration steps with 10 nM 5'-Cy5-labeled 36-mer ssDNA. 

653 The measured Kd of each protein is shown in the table. 

654 Figure 5. Two potential types of PEDV nsp9 dimers and the crucial role of 

655 cysteine. (A) Orthogonal views of the helix dimer. The two monomers are colored 

656 yellow and light blue and are depicted in cartoon and surface representations. (B) The 

657 heart of the dimer interface is formed from three glycines (Gly95, Gly99 and Glyl02). 

658 (C) Orthogonal views of the disulfide-linked dimer. The two monomers are colored 

659 purple and split-pea and are depicted as cartoon and surface representations. (D) The 

660 disulfide bond is depicted as a magenta dotted line and the hydrogen bonds as red 

661 dotted lines, and residues located in the dimer interface are labeled in green. (E) 

662 Velocity AUC analysis of the PEDV nsp9 and nsp9-C59A mutant proteins. The 

663 molecular mass represented by each major peak is indicated, and the biophysical data 

664 are shown in the table. (F) SDS-PAGE analysis of freshly prepared proteins treated 

665 with 5 X SDS-PAGE loading buffer without DTT. The differences between nsp9 and 


D 

o 


Downloaded from http://jvi.asm.org/ on June 22, 2018 by Kaohsiung Medical University 





cfWology JVI Journal of Virology Accepted MdnUSCiTpFPosted Oflllhe 


666 the nsp9-C59A mutant proteins were detected on polyacrylamide gels. (G) HEK293T 

667 cells were transfected with an empty vector or wild-type PEDV nsp9 plasmids. At 48h 

668 post-transfection, the cell lysates were treated with SDS-PAGE loading buffer with or 

669 without DTT and subjected to Western blotting. (H) Illustration of the superimposition 

670 of PEDV nsp9 and PEDV nsp9-C59A monomeric structures. 

671 Figure 6. The disulfide bond can affect the ssDNA binding abilities of PEDV nsp9. 

672 (A) The ssDNA binding abilities of nsp9-WT, nsp9-C59A, nsp9-G95E/G99E/G102E, 

673 and nsp9-C59A/G95E/G99E/G102E as determined by EMSA. Each protein was 

674 assayed at different concentrations (0 pM, 0.41 pM, 0.82 pM, 1.64 pM, 3.28 pM, 6.56 

675 pM, 10 pM) with 5 pM 5'-Cy5-labeled 36-mer ssDNA. (B) Zone-interference gel 

676 electrophoresis illustrating the association of nsp9-WT, nsp9-C59A and nsp9- 

677 C59A/G95E/G99E/G102E with 36-mer ssDNA. The red dotted lines represent the 

678 position of the bands of nsp9-WT, nsp9- C59A and nsp9- C59A/G95E/G99E/G102E 

679 without reaction with ssDNA. (C) The ssDNA binding affinities of nsp9-WT, 

680 nsp9-C59A, nsp9-G95E/G99E/G102E, and nsp9-C59A/G95E/G99E/G102E were 

681 measured by MST assays. Each protein was assayed in twofold concentration steps 

682 with 10 nM 5'-Cy5-labeled 36-mer ssDNA. The measured K d of each protein is 

683 shown in the table. 

684 Figure 7. Impact of positively charged amino acid substitutions on ssDNA 

685 binding of PEDV nsp9. (A) Depictions of the electrostatic potential surface of the 

686 PEDV nsp9 dimer created by APBS tools. The mutated amino acids exposed on the 

687 protein surface are labeled. (B) The ssDNA binding abilities of nsp9-WT, nsp9-K10A, 
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688 nsp9-R68A, K69A, nsp9-R106A, nsp9-K10A/R68A/K69A/R106A and nsp9-Y82A as 

689 determined by EMSA. Each protein was assayed at different concentrations (0 pM, 

690 0.41 pM, 0.82 pM, 1.64 pM, 3.28 pM, 6.56 pM, 10 pM) with 5 pM 5'-Cy5-labeled 

691 36-mer ssDNA. (C) The ssDNA binding affinities of nsp9-WT, nsp9-K10A, 

692 nsp9-R106A, nsp9-K10A/R68A/K69A/R106A and nsp9-Y82A as measured by MST 

693 assays. Each protein was used in twofold concentration steps with 10 nM 

694 5'-Cy5-labeled 36-mer ssDNA. The measured K d of each protein is shown in the 

695 table. 
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Table 1 Statistics of data collection and refinement 


Value(s) 0 


Wild-typePDCoVnsp9 

PDCoV nsp9-AN7 

Wild-typePEDVnsp9 

PEDVnsp9-C59A 

Data collection 





Space group 

P2i 

P2i 

P 64 

P4 3 2i2 

Cell parameter [a, b, c (A)] 

57.97,55.19,69.44 

31.55,83.86,36.34 

73.87,73.87.91.55 

58.73, 58.73, 193.24 

ega>a, p, y(°) 

90.00,90.01, 90.00 

90.00,106.08,90.00 

90.00,90.00,120.00 

90.00, 90.00, 90.00 

Wavelength (A) 

0.97917 

0.97917 

0.97917 

0.97917 

Resolution (A) (range) 

29.79- 1.80 

34.92-1.99 

28.75 -2.89 

38.15-3.00 

Completeness (%) 

97.8(97.0) 

99.7(100.0) 

99.4 (99.2) 

99.9(100.0) 

Emerge 6 ! %) 

6.4(46.6) 

7.5(10.3) 

6.3(67.7) 

5.9(84.9) 

//a(last shell) 

15.71(3.38) 

24.51(23.27) 

35.33(3.00) 

29.8(3.74) 

Redundancy (last shell) 

3.8(3.8) 

7.0(7.2) 

11.1(11.4) 

7.6(7.7) 

Refinement 





Resolution (A) (range) 

29.79- 1.80 

34.92-2.00 

28.75-2.90 

38.15-3.00 

f?work/^£ree C (%) 

20.6/24.6 

21.3/24.8 

30.7/32.7 

29.5/33.5 

No. reflections 

39706 

12364 

6340 

6925 

No of protein atoms 

2970 

1463 

1498 

2249 

No. of solvent atoms 

207 

70 

0 

0 

No. of molecules 

4 

2 

2 

3 

RMSD 





Bond length (A) 

0.007 

0.009 

0.009 

0.005 

Bond angle (°) 

1.018 

1.070 

1.577 

1.348 

Avg B factor (A 2 ) 

33 

20 

109 

103 

Ramachandranplotxore,allow, 
disallow (%) 

97.29,2.71,0.00 

97.81,2.19,0.00 

92.22, 6.67,1.11 

95.40, 3.83,0.77 
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"Highest resolution values are written in parenthesis. 

b R mt rge = 2 S 7j-<I>/XE/j (where is Ii the intensity measurement of reflection h and <I> is 
theaverage intensity from multiple observations). 

V?work = £ ||F 0 I - IF C || / S IFqI; where F 0 and F c are the observed and calculated structure 
factorsrespectively; /?f ree is equivalent to 7? W ork but where 5% of the measured reflections have been 
excluded fromrefinement and set aside for cross-validation. 
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0.7-. 



y=-0.222x+0.7646 

0.6- 



R ! =0.9895 



(y=Kav ; x=lg MW) 


Monomer 



0.4 - 





(12.49m L,12.64kDa) 



0.3- 

Dimer 



0.2 - 

(10.87m L,25.33kDaT" 



0.1 - 




0.0- 






Predicted MW (kDa) 


AUC 


Construct 

Mononmer 

Dimer 

m 

Sio.»(S) 

MW(kDa) 





(monomer/dimer) 

(monomer/dime 

nsp9-WT 

11.99 

23.98 

1.423 

1.369/2.058 

12.56/23.14 

nsp9-AN7 

11.22 

23.98 

1.317 

1.349/— 

10.94/— 

N-finger nsp9-R7A 

11.99 

23.98 

1.373 

1.396/1.961 

12.26/20.41 

nsp9-L4A/L6A/R7A/N8A 

11.99 

23.98 

1.368 

1.316/2.011 

11.16/21.07 

o-hclical nsp9-G96E 

11.99 

23.98 

1.368 

1.349/1.963 

11.58/20.32 

nsp9-G100E 

11.99 

23.98 

1.367 

1.348/2.024 

11.56/21.26 


G 


0.7 0.8 0.9 1.0 1.1 1.2 IJ 1.4 1.5 1.6 1.7 1.8 1.9 2.0 

Ig Molecular Weight(MW) 
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15 16 mL 
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WT 


AN7 


G96E 


G100E 


50 25 0 50 25 0 50 25 0 50 25 0 nM 



L4A/L6A/R7A/N8A 


50 25 0 50 25 0 50 25 0 50 25 0 50 25 0 jiM 


m m—— m w 






c° 




//// /V 

^ ^ ^ ^ Q 





MST 

PDCoY lisp 9 mutants 

Kd(wM) 

SD 

WT 

410 

13.7 

AN 7 

794 

19.5 

G96E 

771 

4.0 

G100E 

1140 

21.0 

L4A 

560 

262 

L6A 

291 

8.2 

R7A 

861 

8.4 

N8A 

120 

0.6 

L4A/L6A/R7A/N 8A 

1530 

17.2 
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Predicted MW (kDa) 

AUC 

Construct 

Mononmcr 

Dimer 

m 

S20.w(S) 

(monomer/dimer) 

MW(kDa) 

(monomer/dimer) 

nsp9-WT 

12.09 

24.18 

1.421 

1.489/1.927 

14.22/20.92 

nsp9-C59A 

12.09 

24.18 

1.393 

1.372/— 

12.2/— 



■ PF.DV nsp» 
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- 


«- W — M h* 


m 


,/ C? x* , 

V ** 


,0° « x 

# J? 

? J* 



MST 

PEDV nsp9 mutants 

Kd(aM) 

SD 

WT 

145 

7.1 

C59A 

396 

11.0 

G95E/G99E/G102E 

2050 

42.6 

C59 A/G95E/G99E/G102 

5240 

106 
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A 



VVT K10A R68A/K69A R106A K10A/R68A/K69A/R106A Y82A 



c 


MST 

PEDV nsp9 mutants 

Kd(uM) 

SD 

WT 

145 

7.1 

K10A 

182 

6.8 

R106A 

390 

3.3 

K10 A/R68A/K69 A/R106 A 

1050 

17.0 

Y82A 

18 

0.3 
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